Analysis of Household Pulse Survey Public-Use Microdata via Unit-Level Models for Informative Sampling

نویسندگان

چکیده

The Household Pulse Survey, recently released by the U.S. Census Bureau, gathers information about respondents’ experiences regarding employment status, food security, housing, physical and mental health, access to health care, education disruption. Design-based estimates are produced for all 50 states District of Columbia (DC), as well 15 Metropolitan Statistical Areas (MSAs). Using public-use microdata, this paper explores effectiveness using unit-level model-based estimators that incorporate spatial dependence Survey. In particular, we consider Bayesian hierarchical both a binomial multinomial response under informative sampling. Importantly, demonstrate these models can be easily estimated Hamiltonian Monte Carlo through Stan software package. doing so, readily implemented in production environment. For responses, an empirical simulation study is conducted, which compares non-spatial models. Finally, Survey micro-data, provide analysis design-based demonstrates reduction standard errors approaches.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Sampling with Synthesis: A New Approach for Releasing Public Use Census Microdata

Many statistical agencies disseminate samples of census microdata, i.e., data on individual records, to the public. Before releasing the microdata, agencies typically alter identifying or sensitive values to protect data subjects’ confidentiality, for example by coarsening, perturbing, or swapping data. These standard disclosure limitation techniques distort relationships and distributional fea...

متن کامل

Privacy Protection from Sampling and Perturbation in Survey Microdata

Statistical agencies release microdata from social surveys as public-use files after applying statistical disclosure limitation (SDL) techniques. Disclosure risk is typically assessed in terms of identification risk, where it is supposed that small counts on cross-classified identifying key variables, i.e., a key, could be used to make an identification and confidential information may be learn...

متن کامل

Parametric Distributions of Complex Survey Data under Informative Probability Sampling

The sample distribution is defined as the distribution of the sample measurements given the selected sample. Under informative sampling, this distribution is different from the corresponding population distribution, although for several examples the two distributions are shown to be in the same family and only differ in some or all the parameters. A general approach of approximating the margina...

متن کامل

Using CART to Generate Partially Synthetic, Public Use Microdata

To limit disclosure risks, one approach is to release partially synthetic, public use microdata sets. These comprise the units originally surveyed, but some collected values, for example sensitive values at high risk of disclosure or values of key identifiers, are replaced with multiple imputations. This article presents and evaluates the use of classification and regression trees to generate p...

متن کامل

The 2006 Earnings Public-Use Microdata File: an introduction.

This article introduces the 2006 Earnings Public-Use File (EPUF) and provides important background information on the file's data fields. The EPUF contains selected demographic and earnings information for 4.3 million individuals drawn from a 1-percent sample of all Social Security numbers issued before January 2007. The data file provides aggregate earnings for 1937 to 1950 and annual earnings...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: Stats

سال: 2022

ISSN: ['2571-905X']

DOI: https://doi.org/10.3390/stats5010010